AITopics | permissive license

Collaborating Authors

permissive license

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Long Code Arena: a Set of Benchmarks for Long-Context Code Models

Bogomolov, Egor, Eliseeva, Aleksandra, Galimzyanov, Timur, Glukhov, Evgeniy, Shapkin, Anton, Tigina, Maria, Golubev, Yaroslav, Kovrigin, Alexander, van Deursen, Arie, Izadi, Maliheh, Bryksin, Timofey

arXiv.org Artificial IntelligenceJun-17-2024

Nowadays, the fields of code and natural language processing are evolving rapidly. In particular, models become better at processing long context windows -- supported context sizes have increased by orders of magnitude over the last few years. However, there is a shortage of benchmarks for code processing that go beyond a single file of context, while the most popular ones are limited to a single method. With this work, we aim to close this gap by introducing Long Code Arena, a suite of six benchmarks for code processing tasks that require project-wide context. These tasks cover different aspects of code processing: library-based code generation, CI builds repair, project-level code completion, commit message generation, bug localization, and module summarization. For each task, we provide a manually verified dataset for testing, an evaluation suite, and open-source baseline solutions based on popular LLMs to showcase the usage of the dataset and to simplify adoption by other researchers.

datapoint, dataset, repository, (14 more...)

arXiv.org Artificial Intelligence

2406.11612

Country:

Europe > Netherlands > South Holland > Delft (0.04)
Asia > Middle East > Jordan (0.04)

Genre:

Workflow (1.00)
Research Report (1.00)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.94)

Add feedback

GECKO: Generative Language Model for English, Code and Korean

Oh, Sungwoo, Kim, Donggyu

arXiv.org Artificial IntelligenceMay-24-2024

We introduce GECKO, a bilingual large language model (LLM) optimized for Korean and English, along with programming languages. GECKO is pretrained on the balanced, high-quality corpus of Korean and English employing LLaMA architecture. In this report, we share the experiences of several efforts to build a better data pipeline for the corpus and to train our model. GECKO shows great efficiency in token generations for both Korean and English, despite its small size of vocabulary. We measure the performance on the representative benchmarks in terms of Korean, English and Code, and it exhibits great performance on KMMLU (Korean MMLU) and modest performance in English and Code, even with its smaller number of trained tokens compared to English-focused LLMs. GECKO is available to the open-source community under a permissive license. We hope our work offers a research baseline and practical insights for Korean LLM research. The model can be found at: https://huggingface.co/kifai/GECKO-7B

arxiv preprint arxiv, gecko, language model, (14 more...)

arXiv.org Artificial Intelligence

2405.1564

Country:

Asia > Middle East > Jordan (0.05)
Asia > Southeast Asia (0.04)
Asia > Singapore (0.04)

Genre: Research Report (0.40)

Industry: Information Technology (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

The Stack: 3 TB of permissively licensed source code

Kocetkov, Denis, Li, Raymond, Allal, Loubna Ben, Li, Jia, Mou, Chenghao, Ferrandis, Carlos Muñoz, Jernite, Yacine, Mitchell, Margaret, Hughes, Sean, Wolf, Thomas, Bahdanau, Dzmitry, von Werra, Leandro, de Vries, Harm

arXiv.org Artificial IntelligenceNov-20-2022

Large Language Models (LLMs) play an ever-increasing role in the field of Artificial Intelligence (AI)--not only for natural language processing but also for code understanding and generation. To stimulate open and responsible research on LLMs for code, we introduce The Stack, a 3.1 TB dataset consisting of permissively licensed source code in 30 programming languages. We describe how we collect the full dataset, construct a permissively licensed subset, present a data governance plan, discuss limitations, and show promising results on text2code benchmarks by training 350M-parameter decoders on different Python subsets. We find that (1) near-deduplicating the data significantly boosts performance across all experiments, and (2) it is possible to match previously reported HumanEval and MBPP performance using only permissively licensed data. We make the dataset available at https://hf.co/BigCode, provide a tool called "Am I in The Stack" (https://hf.co/spaces/bigcode/in-the-stack) for developers to search The Stack for copies of their code, and provide a process for code to be removed from the dataset by following the instructions at https://www.bigcode-project.org/docs/about/the-stack/.

artificial intelligence, large language model, natural language, (15 more...)

arXiv.org Artificial Intelligence

2211.15533

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
North America > Dominican Republic (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)

Genre: Research Report (0.64)

Industry: Information Technology > Security & Privacy (0.93)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

What is transfer learning and why is it needed?

#artificialintelligenceJul-10-2022, 16:15:07 GMT

In this article, we will be discussing a state-of-the-art technique for building complex deep learning models using pre-trained models. We humans have an inherent ability to transfer our knowledge across different tasks. We utilize the knowledge that we acquire from one task to solve other similar tasks. The more related the task, the easier it is for us to transfer or cross-utilize our knowledge. Let's understand this with some examples: Conventional machine learning and deep learning algorithms are designed to work in isolation.

deep learning model, knowledge, pre-trained network, (4 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

GitHub's Commercial AI Tool Was Built From Open Source Code

#artificialintelligenceAug-8-2021, 20:21:21 GMT

Earlier this month, Armin Ronacher, a prominent open-source developer, was experimenting with a new code-generating tool from GitHub called Copilot when it began to produce a curiously familiar stretch of code. The lines, drawn from the source code of the 1999 video game Quake III, are infamous among programmers--a combo of little tricks that add up to some pretty basic math, imprecisely. The original Quake coders knew they were hacking. "What the fuck," one commented in the code beside an especially egregious shortcut. So it was strange for Ronacher to see such code generated by Copilot, an artificial intelligence tool that is marketed to generate code that is both novel and efficient.

copilot, developer, github, (11 more...)

#artificialintelligence

Country:

North America > United States > Colorado (0.05)
North America > United States > North Carolina (0.05)

Industry:

Law > Intellectual Property & Technology Law (0.70)
Leisure & Entertainment > Games > Computer Games (0.35)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.32)

Add feedback

Linux Foundation unveils new permissive license for open data collaboration - JackOfAllTechs.com

#artificialintelligenceJun-23-2021, 10:55:34 GMT

The Linux Foundation has announced a new permissive license designed to help foster collaboration around open data for artificial intelligence (AI) and machine learning (ML) projects. It has often been said that data is the new oil, but for AI and ML projects in particular, having access to expansive and diverse data sets is key to reducing bias and building powerful models capable of all manner of intelligent tasks. To machines, data is a little like "experience" is to humans -- the more of it you have, the better decisions you are likely to make. With CDLA-Permissive-2.0, the Linux Foundation is building on its previous efforts to encourage data-sharing efforts through licensing arrangements that clearly define how the data -- and any derivative data sets -- can and can't be used. The Linux Foundation first introduced the Community Data License Agreement (CDLA) back in 2017 to entice organizations to open up their vast pools of (underused) data to third-parties.

license, new permissive license, permissive license, (12 more...)

#artificialintelligence

Industry: Law > Intellectual Property & Technology Law (0.72)

Technology:

Information Technology > Software > Programming Languages (1.00)
Information Technology > Artificial Intelligence (1.00)

Add feedback

Transfer Learning for Deep Learning: Pre-trained models to save training time and cost

#artificialintelligenceOct-15-2020, 12:21:12 GMT

Training a neural network has been posing problems for researchers and developers for a long time. There are basically two major problems that arise during the development of DL based solution which are the astronomical costs of training, and the time required to train the network. Since training a neural network includes numerous matrix operations and demands a high computational capability, the cost of operation will escalate if one needs to perform a similar process again for another model. Also, the time to train them escalates at an exponential rate as the networks get deeper and complicated. Using GPUs is one effective way to speed up the process.

artificial intelligence, machine learning, neural network, (12 more...)

#artificialintelligence

Industry: Information Technology > Services (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.51)

Add feedback

Inside the 1TB ImageNet data set used to train the world's AI: Nude kids, drunken frat parties, porno stars, and more

#artificialintelligenceOct-23-2019, 16:12:57 GMT

Special report ImageNet – a data set used to train AI systems around the world – contains photos of naked children, families on the beach, college parties, porn actresses, and more, scraped from the web to train computers without those individuals' explicit consent. The library consists of 14 million images, each placed into categories that describe what's pictured in each scene. This pairing of information – images and labels – is used to teach artificially intelligent applications to recognize things and people caught on camera. The database has been downloaded by boffins, engineers, and academics to train hundreds if not thousands of neural networks to identify stuff in photos – from assault rifles and aprons to magpies and minibuses to zebras and zucchinis, and everything in between. In 2012, the data set was used to build AlexNet, heralded as a breakthrough development in deep learning since it marked the first time a neural network outperformed traditional computational methods at object recognition in terms of accuracy.

category, imagenet, neural network, (16 more...)

#artificialintelligence

Country:

Oceania > New Zealand (0.05)
North America > United States > New York (0.05)
North America > United States > California (0.05)

Industry: Information Technology > Security & Privacy (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

A Gentle Introduction to Transfer Learning for Deep Learning - Machine Learning Mastery

#artificialintelligenceDec-23-2017, 20:10:46 GMT

Transfer learning is a machine learning method where a model developed for a task is reused as the starting point for a model on a second task. It is a popular approach in deep learning where pre-trained models are used as the starting point on computer vision and natural language processing tasks given the vast compute and time resources required to develop neural network models on these problems and from the huge jumps in skill that they provide on related problems. In this post, you will discover how you can use transfer learning to speed up training and improve the performance of your deep learning model. A Gentle Introduction to Transfer Learning with Deep Learning Photo by Mike's Birds, some rights reserved. Transfer learning is a machine learning technique where a model trained on one task is re-purposed on a second related task.

artificial intelligence, deep learning, machine learning, (13 more...)

#artificialintelligence

Genre: Instructional Material (0.35)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback